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Selecting Multiple Paths in Overlay Networks for Streaming Data 
Field of the Invention 

[0 1 ] This invention relates to communications networks, and more specifically 
to selecting multiple communication paths for streaming data in heterogeneous 
networks including physical and overlay networks. 

Background of the Invention 

[02] It is desired to stream data over the Internet in a reliable and cost effective 
manner. Typically, the streamed data are a video and/or an audio signal. Streaming 
data can be characterized by strict delay constraints, and sensitivity to packet losses. 
The current infrastructure of packet networks, such as the Internet, is insufficient to 
adequately support streaming applications. The quality of end-to-end streaming can 
be improved by using special coding techniques, such as multiple description coding 
(MDC) or fine-granularity-scalability (FGS) coding. 

[03] These coding techniques work particularly well when multiple paths are 
used for the description, see Apostolopoulos, "Reliable video communication over 
lossy packet networks using multiple state encoding and path diversity," in VCIP, 
2001, and Lin et al, "Video Transport over Ad Hoc Networks Using Multiple 
Paths," Proceedings of The 2002 IEEE International Symposium on Circuit and 
Systems, May 2002, and Zhou et al., "Multi-path transport of FGS video," Packet 
Video Workshop, 2003, Begen et al, "Multi-path selection for multiple description 
encoded video streaming," in IEEE ICC, 2003, and Begen et al., in "Fast Heuristics 
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for multi-path selection for multiple description encoded video streaming," IEEE 
ICME, 2003. 

[04] Content delivery networks (CDN) have been used to provide path diversity 
for MDC video streaming, see Apostolopoulos et al., "On multiple description 
streaming with content delivery networks," IEEE INFOCOM, 2002. However, that 
method requires that videos are pre-stored in edge servers of the network. Edge 
servers are located near the destination clients. This increases the cost of the 
streaming. 

[05] Prior art path selection usually requires that all links in the selected paths 
are disjoint. That is, the paths are 'uncorrelated'. However, techniques based on 
maximally link-disjoint paths severely restrict the number of available paths, and on 
the average tend to be longer, increasing cost, latency, and packet losses. 

Summary of the Invention 

[06] A method selects multiple paths between a server and a client in an 
overlay network where nodes are connected by links. 

[07] The nodes include the server and the client. Each path includes a set of 
selected links. First, in each node, quality of service metrics are measured for each 
link directly connecting the node to an immediate neighboring node. The metrics 
can include bandwidth, latency, and packet loss rate. 
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[08] The metrics are transmitted to the server. In the server, a link correlation 
matrix based on the metrics and a path correlation matrix based on the link 
correlation matrix are determined. 

[09] Then, the multiple paths are selected based only on the metrics, the link 
correlation matrix, and the path correlation matrix. At that point, data can be 
streamed from the server to the client. 

Brief Description of the Drawings 

[010] Figure 1 is a diagram of a heterogeneous network according to the 
invention; and 

[Oil] Figure 2 is a block diagram of a method for selecting multiple paths in an 
overlay network according to the invention. 

Detailed Description of the Preferred Embodiment 

[012] Figure 1 shows a heterogeneous network 100 that uses a method for 
selecting multiple paths for streaming data according to our invention. Our network 
100 includes an overlay network 1 10 and a physical network 120. Overlay nodes 
1 1 1 of the overlay network, shown as open circles, are connected by overlay links 
1 12, and physical nodes 121 of the physical network, shown as solid circles, are 
connected by physical links 122. 
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[013] The overlay network 1 1 0 is a virtual network built on top of an existing 
physical network, e.g., the Internet, see Andersen et al., "Resilient overlay 
networks," 18 th ACM Symposium on Operation Systems Principles, 2001. The 
vertical lines 105 indicate connections between the nodes of the overlay and 
physical networks. As an advantage, our overlay network 1 10 requires minimal 
support and knowledge of the underlying physical network 120. 

[014] The overlay network 1 10 includes a server node 101 as a source for the 
data to be streamed, and a client node 102 as a destination for the streamed data. The 
source and destination nodes are connected by intermediate nodes 103 and the 
overlay links 1 12. A set of links that provides a complete end-to-end connection 
between the server and client is called a path. A number of different paths are 
possible. 

[015] It is an object of the invention to determine multiple optimal paths from 
the server to the client. It is another object of the invention to maintain such optimal 
paths dynamically as the network conditions and traffic patterns change over time. 

[0 1 6] Our overlay network 1 1 0 makes minimal assumptions about the 
knowledge and support of the underlying physical network 120. Periodically, the 
client 102 makes a request to receive streamed data from the server 102. When the 
server receives such a request, the server dynamically selects multiple paths to 
deliver the streamed data the client using paths in the overlay network. By using 
multiple paths as selected by the invention, the quality of service (QoS) is improved 
over the prior art. 
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[017] In the prior art, improved service can be attained by storing a copy of the 
desired video in servers at the edges of the network. That is, the servers are near the 
clients. However, videos consume enormous amounts of storage, thus the cost of the 
overall system is increased dramatically if multiple edge servers are used to store 
copies of all desired videos. In contrast, the invention only stores one copy of the 
video at the originating server, no matter how far removed that server is from the 
client. 

[018] According to the invention, each node 1 1 1 in the overlay network 1 10 
monitors QoS metrics of the direct links 1 12 to immediate neighboring nodes, for 
example, bandwidth, latency, and packet loss rates. These metrics, for each link are 
forwarded to the server 101. The server uses the metrics to select multiple optimal 
paths from the server to the client in a dynamic manner, as described in detail below. 

[0 1 9] The problem with prior art path selection methods is that they require 
paths with disjoint sets of links. That is, the link cannot be part of more than one 
overlay path. There are several problems with that selection method. First, the 
selection is based primarily on network topology as opposed to network condition. 
This is shown in the example network of Figure 1. Here overlay links 1 13-1 14 are 
different but they are implemented, in part, by a common link 125. It is not possible 
to select multiple paths for the network 100 using prior art techniques, such as a 
maximally link-disjoint path method. Second, that method severely limits the 
number of paths that can be selected. This has the effect that paths, on the average, 
tend to be longer and more costly than needed. Longer paths tend to have lower 
performance. 



5 



Merl-1502 
Shaoetal. 



[020] Our solution to this problem is twofold. First, we use a QoS metric based 
on both link correlation and path correlation for the multi-path selection problem. 
Second, we provide a cost function for the link and path correlations to select 
multiple paths in an optimal manner. Our invention recognizes that many links in 
the backbone of the physical network have very high bandwidths and are extremely 
reliable. In fact, the QoS metrics for such links are constantly improving over time. 
Therefore, in a practical application, it is quite conceivable, and perhaps 
advantageous to share such a high performance physical link among multiple 
overlay paths. 

[021] Correlation Matrices 

[022] In our overlay network 1 10, the server 101 maintains a link correlation 
matrix 150 for all links and a path correlation 151 matrix for all paths. The link 
correlation matrix relates each overlay link to all other overlay links based on QoS 
metrics 152 also maintained by the server. Similarly, the path correlation matrix 
relates each possible path to all other possible paths. 

[023] Link Correlation 

[024] Unlike the prior art, we make no assumptions about a state or topology of 
the underlying physical network 120. At the server 101, we use statistical 
information 152 of the links 1 12 collected by the nodes 1 1 1 to define the link and 
path correlations 150-152. Each node collects metrics about the direct links that 
connect that node to each immediate neighboring node. 
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[025] The link correlations (O) 150 based on the metrics L are: 

Cr(L in L mn ) = 1 /2 + — . (1) 

where (/ and mw are a pair of links, E is an expectation for the links. This formula is 
based on the standard definition for a linear correlation between two random vectors, 
where the covariance is divided by the variance. Normally, the expectations are in 
the range [-1, +1]. However, we use a scaled form in equation (1), i.e., QA + l A [.]). 

This guarantees that O^G^/J^l. The values Ly and L mn are metrics for link ij 

and link mn. The metric can consider, for example, bandwidth, latency, and packet 
loss rate. Here, an average of a metric is L u = E(L b ) . 



[026] All the expectations and CriL ip L mt ) can be measured dynamically and 

periodically over some time window so that the information 150-152 at the server as 
always current when paths need to be selected. Furthermore, should network 
conditions change while the data are streamed, for example, the condition of a path 
deteriorates, then a better path can be selected so that data are always streamed in a 
most effective manner. 



[027] As stated above, each overlay node probes the links connecting the node to 
neighboring overlay nodes to obtain the link information 152. This information can 
be periodically updated and transmitted to the server with usual routing packets. The 
server sets a time window size and also periodically update the correlation matrices 
150-151. 
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[028] Path Correlation Model 

[029] The path correlation matrix 151 is also based on the QoS metrics 152. 
Path A and path B are two paths in the overlay network with a link set a e A and a link 
set b e B. The correlation matrix for these two paths is defined as: 
Cr{Path A ,Path ,) = £ £ Cr(a,b) (2) 

aeA beB 

[030] This correlation matrix together with the measured QoS metrics 152, such 
as latency, packet loss rate and bandwidth, is used to select the multiple paths 
between the 101 server and the client 102 for streaming data, e.g., a video. 

[031] Minimal Correlation Multi-Path Selection Problem 

[032] Our path selection method can be formulated as an optimization problem. 
In the following descriptions, we select two paths. However, it should be understood 
that more than two paths can be selected using our method. 

[033] In our multi-path selection method, the overlay network 1 10 is represented 
by a graph g = (V, £), where V is the set of nodes and £ is the set of links. Each 
link (ij) € £ is associated with R non-negative and additive QoS values 
W r (iJ\r = 1,2,...,/? 152. The additive QoS metric means that a path metric is a 
summation of the metrics of the set of links £ that define the path. Examples of 
additive QoS metrics are latency, delay jitter, and bandwidth. Some non-additive 
QoS metric, such as packet loss rate can be transformed to an additive metric using a 
conventional logarithmic function, see Zheng et al., "Quality of service routing for 
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supporting multimedia applications," IEEE Journal of Selected Areas in 
Communications, Vol 14, No. 7, pp. 1228-1234, 1996, for details. 



[034] In addition, there are R corresponding path QoS constraints Dr, r = 1, 2, . . ., 
R, as defined below. A correlation between link (ij ) and (m, n) is defined by 
equation (1) as Cr (Ly, L mn ). 

[035] We also define link validity indication vectors X = (...Xy. . .) and 

Y=(... Yy. . .). These vectors indicate whether a link is valid in a first path (Xy = 

1), or in a second path (Yy= 1). 



[036] Then, we minimize 
I Y. X ij- Y m n Cr(L ij ,L mn \ (3) 

(f',y')e£ (m,n)eE 

subject to link validities 

X y z {0,\}, Y mn e {0,1} y{i,j)*E, 

and the following constraints 
y Zx y W r (i,j)<D r ,Vr = l,2,...,R 



Z - I r~ = 

(m ,n )e £ (n ,m )€ £ 



(4) 



1, if i = s 
-\,ifi = d (5) 
0, e/se 

1 , i/ W = 5 

-l,tf m=d . (6) 
0, efee 
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[037] Equation (4) expresses the R different QoS metric constraints W 152, 
while equations (5) and (6) are link constraints for the first and second path, 
respectively. In equations (5) and (6), the symbols s and d indicate the server (source 
node) and client (destination node), respectively. 



[038] This minimization is an NP-hard problem. The matrix Qpm has 



C r (Lij>L mn ) entries. We can define vectors P 2 | £|xl = and D 2 \e\*2*\e\ = 



v 



0 C 
0 



and change the corresponding constraints so that the minimization objective 
becomes -P T DP, where T denotes an inverse transform in the conventional manner. 

2 

This is a standard format of well known quadratic programming which is a NP-hard 
problem. 



[039] Multi-Path Selection Method 

[040] As shown in Figure 2, the NP-hardness of our multi-path selection 
problem requires us to provide a heuristic solution. In step 0, we invalidate all links 
in the overlay network with available bandwidths that are less than a target 
streaming rate. We measure the QoS metrics for the links, and construct and 
maintain the link and path correlation matrices as described above. 



[041] In step 1, we first select a 'shortest' path based on predetermined QoS 
metrics, e.g., latency and packet loss rate. 
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[042] In step 2, we update the available bandwidth of each link according to 
previously selected paths. Again, we invalidate any links with a bandwidth less 
than the target streaming rate. 

[043] In step 3, we take into account the correlation between links. Therefore, 
we determine a correlation cost (cc) for each link L with respect to a previous 
selected link set S of a path as 

Cr s L = Y,Cr(L,a), (7) 

aeS 

where L e £, and a e S, as defined above. The correlated cost of link L = (i J) 
with the selected set S is a summation of all the pair wise correlation between L 
and every link in the set S. 

[044] Then, in step 4, we combine the correlation cost and the R measured 
metrics Wto obtain a new cost for each link using a cost function 

Cost L s = a ■ Cr L s +f j a l W r {i,j). (8) 

This cost function is a weighted sum of the path correlation matrix 151 and the 
metric W 152, where a and a , are weighting factors. 

[045] In step 5, we find a next shortest path based on the usual link metrics, 
such as bandwidth, latency and packet loss rate. We use the correlation cost as 
another constraint to determine the next shortest path. To find the one path that 
satisfies the multiple constraints, we use the cost function of equation (8) that 
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combines the correlation cost and the most important link metrics, such as packet 
loss rate and latency, using the appropriate weighting factors. 

[046] We use the well known Dijkstra algorithm to find the next shortest path 
according to the new cost. The Dijkstra algorithm is used for link-state routing 
protocols to help find the shortest path through a mesh of network connections 
based on the path with the least cost. The algorithm performs a series of 
calculations that eventually develop the cost of pathways to nodes, and the 
pathway that has the least cost, in a well known manner. 

[047] Because finding a path with multiple constraints is known as a NP-hard 
problem, we use equation (8) to improve performance. Various approximation 
algorithms for this problem are also known. We can replace steps 4 and 5 by such 
known approximations. 

[048] Steps 2 through 5 are repeated until all paths have been found. 

[049] In a dynamic version of the selection method, all links are reevaluated, 
and new paths are selected depending on current and dynamic network conditions 
by repeating with step 0. 

Effect of the Invention 

[050] The invention provides a multi-path selection method for streaming 
multiple description coded (MDC) video in an overlay network of the Internet. We 
provide a QoS metric and link correlation matrix. Based on the link correlation 
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matrix, the invention provides a path correlation matrix for the multi-path selection 
problem of overlay network. The invention also provides an efficient multi-path 
selection procedure, called correlation cost routing, to find multiple paths in the 
overlay network. The invention improves performance by up to 3dB when compared 
with prior art maximally link-disjoint paths for streamed MDC video. 

[05 1] The prior art selection method based on maximally link-disjoint paths 
does not accurately specify the correlation between paths in the overlay network, 
and thus has a lower PSNR. Our correlation cost routing procedure incorporates 
both the static and the dynamic correlations between overlay links into the path 
selection method, and thus, has a better PSNR. Our method allows a link to be 
used in more than one path and thus, is more likely to meet delay constraint than 
the maximally link-disjoint paths method. Indeed, our method has a lower packet 
loss rate due to path diversity, and our method has fewer delayed packets due to 
shorter path length, and is less complex to implement. 

[052] As a most important difference, the prior art first considers the network 
topology to select viable paths, and only then considers path QoS metrics. In 
contrast, the invention is entirely based on correlations of QoS metrics of links and 
paths. Network topology is not a factor. 

[053] Although the invention has been described by way of examples of 
preferred embodiments, it is to be understood that various other adaptations and 
modifications can be made within the spirit and scope of the invention. Therefore, 
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it is the object of the appended claims to cover all such variations and 
modifications as come within the true spirit and scope of the invention. 
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